Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018.
“
BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding.” CoRR abs/1810.04805. http://arxiv.org/abs/1810.04805.
Jakubowski, Alexander, Milica Gasic, and Marcus Zibrowius. 2020.
“Topology of Word Embeddings: Singularities Reflect Polysemy.” In Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, 103–13. https://arxiv.org/abs/2011.09413.
Jurafsky, Daniel, and James H. Martin. 2009. Speech and Language Processing. MIT Press.
Mikolov, Tomás, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013.
“Efficient Estimation of Word Representations in Vector Space.” In 1st International Conference on Learning Representations,
ICLR 2013, Scottsdale, Arizona, USA, May 2-4, 2013, Workshop Track Proceedings, edited by Yoshua Bengio and Yann LeCun. http://arxiv.org/abs/1301.3781.
Mikolov, Tomás, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013.
“Distributed Representations of Words and Phrases and Their Compositionality.” CoRR abs/1310.4546. http://arxiv.org/abs/1310.4546.